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Abstract 

As quantum information science approaches the goal of constructing quantum computers, under- 
standing loss of information through decoherence becomes increasingly important. The information 
about a system that can be obtained from its environment can facilitate quantum control and error 
correction. Moreover, observers gain most of their information indirectly, by monitoring (primarily 
photon) environments of the "objects of interest." Exactly how this information is inscribed in 
the environment is essential for the emergence of "the classical" from the quantum substrate. In 
this paper, we examine how many-qubit (or many-spin) environments can store information about 
a single system. The information lost to the environment can be stored redundantly, or it can 
be encoded in entangled modes of the environment. We go on to show that randomly chosen 
states of the environment almost always encode the information so that an observer must capture 
a majority of the environment to deduce the system's state. Conversely, in the states produced by 
a typical decoherence process, information about a particular observable of the system is stored 
redundantly. This selective proliferation of "the fittest information" (known as Quantum Dar- 
winism) plays a key role in choosing the preferred, effectively classical observables of macroscopic 
systems. The developing appreciation that the environment functions not just as a garbage dump, 
but as a communication channel, is extending our understanding of the environment's role in the 
quantum-classical transition beyond the traditional paradigm of decoherence. 
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I. INTRODUCTION 



In recent years, the field of quantum information science has grown tremendously. Quan- 
tum cryptography is rapidly becoming the domain of engineers more than of physicists; 
quantum optics and nanotechnology are making quantum systems more and more directly 
accessible; and quantum computers are seen not as a pipe dream, but as the next century's 
inevitable technology. 

In this context, many scientific questions of fundamental interest have donned new hats 
as pressing technological issues. The transition from quantum to classical, a subject that 
used to be the field of philosophers and quantum foundationalists, is suddenly a critical area 
of research - where is it, and how does it happen? If we are to design quantum hardware, 
then one of the primary failure modes is precisely its transition to classical behavior. The 
process by which this transition occurs is decoherence, and decoherence is accordingly more 
interesting than ever today. 

That quantum environments interact with quantum systems and, by doing so, decohere 
them, is by now widely appreciated. Until recently, models of decoherence usually (from a 
conceptual standpoint) stopped at that point; the challenge was to identify (a) what the 
environment was doing to the system, and (b) what was the resulting evolution of ps, the 
state of the system. The environment obviously gained information about the system - 
effectively measuring it - but what happened to that information once it had been sucked 
away was largely irrelevant. The standard ansatz of decoherence analysis reflects this: from 
the state of the entire universe, one obtains the system's state p$ by tracing out the 
environment S. Once £ is traced over, it vanishes from the scene. 

Recently, the environment has started to become interesting again The 
most obvious feature that distinguishes the classical realm from the quantum is objectivity. 
Unknown classical states can be found out without being disturbed, so they are said to 
"exist objectively." This is, of course, not the case for quantum states. Thus, as it has 
often been emphasized by Asher Peres , the nature of a quantum state is a much 
more elusive thing. In quantum systems (at least isolated ones) such states cannot be 
regarded as existing objectively. As was pointed out some time ago [8], however, observers 
usually obtain information about a system by measuring its environment. Therefore, the 
manner in which the environment stores and transmits information is of great interest. In a 
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sense, the environment is not just a wastebasket (where quantum information gets thrown 
away), but rather a communication channel through which observers find out about their 



universe. In this "environment as a witness" view of 



;he transition from quantum to classical, 
5] with which information is inscribed 



objectivity is a consequence of the redundancy nana 

in S. The environment has also become interesting because a greater understanding of 
the entire decoherence process can facilitate the protection of fragile quantum information. 
Quantum control, for instance, is often based on "second hand" information obtained from 
the environment. On the other hand, it's much easier to recover all of the lost quantum 
information when a single copy is stored in a local environment than when trillions of copies 
have been recorded throughout the universe. 

In this paper, we skip almost completely over the traditional approaches to decoherence, 
and examine instead some very general properties of quantum states. We ignore the processes 
that produce states in which the environment has information about the system, analyzing 
instead the way in which that information is recorded in the environment. Our "toy" universe 
consists of a collection of qubits (or spin-| particles), one of which is the system S. We begin 
by asking a very simple question: When the whole environment has got some information 
I about the system, how much of that X can be found out by measuring a single qubit, or 
two qubits, or m qubits? We ask this question about arbitrary states of the universe, and 
then we ask it about a particular sub-ensemble of states. 

We conclude that our experience of the "real classical universe" is consistent with states 
that record certain information redundantly at the expense of complementary information. 
Such states appear naturally in the process of decoherence. Redundantly recorded observ- 
ables of the system are in effect objective; they can be independently deduced by many 
observers from the state of 8 that was responsible for decoherence . Finally, we 

examine the concept of redundant information storage directly, and relate this quantitative 
measure of redundancy to our more qualitative tools for analyzing joint system-environment 
states. 

II. LAYING OUT THE PLAYING FIELD 

In any discussion of decoherence, we have to start by defining a system (S) and an 
environment {£). In order to go further and examine the redundancy of information storage, 
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we also need to partition £ into sub-environments £\ . . .£n- For this entire discussion, then, 
our universe will be an (N + l)-qubit Hilbert space H = H S ® (gCi H £i , where both H s 
and all the Ti,£. are single qubits (or spin-|'s). 

Our paradigm for understanding decoherence rests on the idea that environments de- 
cohere the system by measuring it, and that in measuring the system the environments 
come to have information about the system. Of the several quantities we could choose to 
measure how much information an environment £ has about the system, we choose quan- 
tum mutual information (QMI), which is the amount of entropy produced by destroying 
correlations between S and £: 

1 S . £ = H(S) + H(£)-H(S£) (1) 

While other appealing measures of information exist the QMI is mathematically elegant 
and generally easy to compute. We note here a few properties of X for future reference. 

• Is.s — if and only if p S£ = ps ® Pe (i- e - ; the subsystems are uncorrelated) . 

• When p S£ is a pure state, X s . £ = 2H(S), since H{£) = H(S) and H{SS) = 0. 

• When pse is a pure state, and £ is partitioned into £\ and £2, Is-.d +1s-£ 2 — ^s-.e- 
This follows simply from expanding X using Eq. ^ and the symmetry of bipartite 
entanglement for pure states. Note that this formula is not true for mixed p$ £ , or for 
divisions of £ into more than two sub-environments! 

• For pure states of S£ , Xs-.e is also an entanglement measure. This connects our work 
with an existing body of work on multipartite entanglement, particularly in randomly 
or dynamically produced states, _ (see, e.g., 0, [l(3])> and on the "entangling power" 



of dynamical maps (see e.g. 
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Q], and refs. 24-35 in |11|. In this paper, how- 
ever, we intentionally avoid the complicated field of entanglement by treating mutual 
information only as a measure of correlation. 



Within this framework, the overall question we examine is: How do individual envi- 
ronments' information (about S) compare to the whole environment's informa- 
tion?. We'll assume that no particular environment is "special"; rather, we'll examine the 
information possessed by a typical environment or collection thereof. It should be noted 
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that this is not the same as assuming that all environments are identical; we allow environ- 
ments to have more or less information about the system, but we obtain a statistical "typical 
environment" by averaging. This assumption reduces our overall question to a more defi- 
nite one: How does the mutual information I between the system and a typical 
sub-environment £{ m } of size m depend on ml 

Our primary tool for answering this question is the partial information plot or PIP (see 
Fig. [IJ, which plots Is-.s {m} against m. Since there are many sub-environments £{ m } contain- 
ing m individual environments, the partial information plot is not a curve, but a scatter-plot 
or histogram. To obtain a smooth curve, we average over all £{ m } to obtain X(m). In later 
sections, we'll discuss the advantages and disadvantages of this averaging scheme (as op- 
posed to, e.g., taking the average of m over all £{ m } with information I). Before examining 
specific PIP's, we prove a simple but important lemma. 

Lemma 1. When p$s is a pure state, T{m) is antisymmetric around m = y. 

Proof. As noted previously, when £ is partioned into £{ m } and £{Ar_ m }, then T s , £{m} + 
1s-.s {N _ m} — Zs-.s- We obtain X{m) by averaging T s . £{m} over all sub-environments of size m, 
and for each £{ m } there exists a unique complement £{N- m }. Thus 



J(m) +1(N - m) = 




Since X(m) + I(N — m) = Zs-.e, 1{m) must be antisymmetric around m = y; further, 
i{^)=\T S :S = H{S). □ 

From Lemma ^ and the obvious condition that I(m) must be non-decreasing, the PIP 
can take three basic shapes (Fig. Q). The simplest profile (a) is one where I(m) oc m - 
that is, each environment provides unique and independent information, so that the total 
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FIG. 1: The three basic profiles for partial information plots (I vs. m). Curve (a) shows the 
behavior of independent environments; curve (b) shows redundantly stored information; and curve 
(c) illustrates the case where information is encoded in multiple environments. 

information gained is simply proportional to the number of environments examined. Alter- 
natively (b), information may be redundantly stored; X(m) increases rapidly at first, then 
plateaus at X ~ H(S). Finally (c), information about the system may be encoded, so that 
X(m) remains close to 0, then increases rapidly around m ~ y. As we shall see, each of 
these cases is realizeable and physically relevant. 

III. INFORMATION STORAGE FOR RANDOMLY CHOSEN STATES 

In order to have a baseline to compare other ensembles of states against, we start by 
examining the typical PIP for the ensemble of all states (the uniform ensemble). To perform 
this average requires a measure; we use the unique invariant measure induced by the Haar 
measure over the unitary group U(2 N+1 ). This simplifies the process, since averaging over 
all possible sub-environments £{ m } with m components is neatly subsumed in the average 
over all states. 

In fact, while the calculation can be done numerically, there exists an analytic solution. 
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Page Q| conjectured a formula for the mean entropy (av erag ed over Haar measure) of a 



ean entropy ( av erag ea c 
Q and others Q Q- 



subsystem, which was later proven by Sen [14j and others [la, |16[ . Page's formula for the 
mean entropy H(m, n) of an m-dimensional subsystem of an mn-dimensional system (where 
m < n) is 

ran _ 

- . . v-^ 1 m — 1 , s 

fe=n+l 

We use this formula to calculate the mean entropy -fffc ; Ar of a fc-qubit (2 fc -dimensional) sub- 
system of an iV-qubit (2 JV -dimensional) universe, obtaining an expression in terms of the 
digamma function \I/(n): 

H k , N = ^(2 N + 1) - ^(2 (7V - fe) + 1) - 2 k - N -\2 k - 1) (3) 

The average mutual information between a 1-qubit system and an m-qubit sub-environment, 
all within aniV + 1-qubit universe, is 



Zs:£ {m} — Hi-N+l + H m -M+l — H m+ i-N + i. (4) 

PIPs computed from Equation 0] are shown in Fig. [21 along with brute-force numerical 
calculations which agree extremely well with Equation 0] The plots demonstrate that for 
even a modestly large (N ~ 9) environment, the vast majority of randomly-chosen states 
are "coding states", which encode the information about the system so that it cannot be 
accessed without capturing at least ~ N/2 environments. Such states display no redundancy 
whatsoever; when three or more observers attempt to divvy up the environment so that each 
has the same amount of information, each observer learns nothing. 

Since the single-qubit system's entropy is limited to 1 bit, Ig : £ < 2; nonetheless, for envi- 
ronments larger than about 5 qubits, almost every state achieves the bound. An additional 
calculation also confirms the indication in Figs. dandEl that the slope of T(m) at m = N/2 
is almost exactly 1. Thus, for large N, almost the entire 2 bits of available information are 
gained between m = y — 1 and m — y + 1. As we shall see in the next section, these results 
don't imply that information is encoded this way in every state; rather, the vast majority 
of states are "coding states." 
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FIG. 2: Partial information plots (both X and m are measured in bits), averaged over all states, 
for environments of 2 to 9 qubits. The circles represent numerical computations, while the solid 
lines represent theory (analytically continued to non-integer m). The plots indicate that informa- 
tion about the system is encoded in entangled states of the environment; nearly N/2 individual 
environments must be captured to yield more than a pittance of information. At m = N/2, where 
I(m) climbs most rapidly, its slope is almost exactly 1. 

IV. INFORMATION STORAGE FOR PRODUCT ENVIRONMENTS 



The results of the previous section indicate that the vast majority of states are non- 
redundant coding states, in which information is recorded so that multiple observers can 
never independently measure the same system. Our everyday experience, however, contra- 
dicts this result. By intercepting a tiny fraction of the photons (or phonons) which scatter 
off (or are emitted by) another human being, we obtain a great deal of information about 
their appearance (or speech); moreover, hundreds or thousands of other human beings ob- 
tain essentially the same information! We are thus led to conjecture that the ensemble 
of system-environment states produced by natural processes is substantially different from 
the ensemble of all states. 

Physical processes that produce decoherence typically have a tensor-product structure. 
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The system interacts independently with each environment, and the environments inter- 
act only weakly with each other. We thus consider interaction Hamiltonians of the form 
H(«S, £) = ^\ H(«S, Si). If the initial states of the environment are uniformly distributed 
over the Haar measure of Tig, however, then the action of any set of unitaries will yield 
a uniformly distributed set of states. In order to get a ensemble which is not uniform, we 
consider only initial product states: \^ss) = \^s) ® <S>i l^Si)- Under these assumptions, the 
"universe" will evolve into states of the form 



where |0) and |1) are used, without loss of generality, to indicate the pointer basis of the 



and asking different questions) in [4]. Subtly different ensembles of such states can be 



feature of these states is the structure of correlations. The system is strongly entangled 
with the overall environment, and potentially with some of the sub-environments, but the 
environments are never entangled with each other. Strong classical correlations, however, 
typically exist between all subsystems. Essentially, all correlations are mediated through S. 

Such states turn out to be quite amenable to mutual information calculations. Because 
of the correlation structure, every relevant reduced density matrix is at most rank-2 - that 
is, a state of a virtual qubit. In order to obtain mutual informations, we compute the 
density matrices for (a) a reduced universe consisting of S and a set £ m = {i± . . .i m } of 
environments; (b) just S; and (c) just £ m . Using the definition 7, = (^1^), we obtain: 



*se) = a \0) s (g) \^) £i + (3 |1> 5 (g) |$> 



(5) 




constructed, depending on the measure used for \ip) s and l^}^; but the most important 







(9) 
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We characterize each density matrix of the form 

,= ( x v*a-*h | (10) 

\ y/x(l - x)Y 1 - X 

by a base purity Pq = x 2 + (1 — x) 2 and a additive decoherence factor d = —log (|7| 2 ). The 
idea is that Pq represents the susceptibility of p to decoherence in the diagonal basis, while d 
represents the amount of decoherence (d — indicates a pure state, while total decoherence 
is indicated by d — oo). Conveniently, it's also appropriate to think of d as representing 
distinguishability, since it measures how nearly orthogonal \ipi) and |^) are. In terms of Pq 
and d, the Von Neumann entropy H = — Tr(plogp) is: 

H(P , d) = In 2 - - [(1 + z) ln(l + z) + (1 - z) ln(l - *)] 
where * = ^/l - 2(1 - P ) (1 - e- d ) (11) 

The key advantage to this treatment is that the (i-factors are additive. If we define di = 
—l°g\li\ 2 , then the decoherence factors for S£ m , S m , and S are 

d S£m = Yl * ( 12 ) 
d £m = Yl d * ( 13 ) 

^ = x; * (14) 

all i 

= ^5£ m + d£ m , 

while the base purities are all the same. Thus, in order to compute the mutual information 
between S and an environment S m , we need only know Pq, d$ (a measure of the information 
lost to the full environment), and dg m (a measure of the amount of information held by £ m ). 
Given these, the mutual information is simply 

l S:£[m} = H(P , ds) + H(P , d £m ) - H(P , d s -dej, (15) 

and in cases where Pq and ds are fixed, we can simply consider I(d^ m ). For the rest of this 
paper, we'll assume that Pq = \ - that is, that the initial state has one full bit of information 
to be measured. Our investigations indicate that most of our results are almost identical for 
other values of Pq, however. 
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FIG. 3: Partial information plots (I(m)) for unimodal distributions of environment <i-factors. X(m) 
is plotted against the fraction m/N of environments captured. As the total ds increases from 2 to 
8 to 32 to 128, the plots become more sharply inflected near m = and m = N, while the central 



m = + , then jumps again to 2 bits at m = N . 

The simplest product-environment state is one where the states \ipi) and \ip[) are equally 
distinguishable for each z; that is, di = d , V i. In this case, 1s-.e {m x — Z(mdo), and no 
averaging over distinct sub-environments is necessary. In Fig. we plot such PIPs for 
various values of ds- Note that the only free parameter is the total decoherence ds- For 
small ds, the total mutual information between system and environment is substantially less 
than 2H(S), and Ts£ {m} increases almost linearly with m. As ds increases, Isx approaches 
2H(S), and the X(m) curve becomes more sharply curved, indicating increased redundancy. 
Perfect redundancy - where a sub-environment has all the classical information that the 
entire environment has - is achieved only for a GHZ-type state with ds = 00. 

Of course, we can imagine a state in which the system is fully decohered (ds = 00), 
but not all sub-environments possess perfect information. More generally, the di may be 
unequal, so that some environments have more information than others. In this case, to 
obtain a valid PIP we must actually perform an average over sub-environments. Given a 



plateau at I = 1 bit grows flatter. In the limit as ds 



00, Z(m) jumps immediately to 1 bit at 
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distribution function fi(d) for the di, we can obtain a distribution function f m (d) for the 
total decoherence of m randomly chosen environments. The PIP is then given by 

= J f m (dW)<H. (16) 

In general, both obtaining f m (d) and computing the given integral are nontrivial. We there- 
fore present two sample cases. 



Case 1: A single state 

Of particular interest is the case where the state \^f) is known and has a finite number N 
of sub-environments. In this case, the di form a finite collection, and so both f\(d) and f m (d) 
are discrete probability distributions. A general technique for large N is to treat the process 
of sub-environment selection as a random walk: fi(d) is characterized by a mean value d 
and a width Ad. The Central Limit Theorem then predicts that as m gets large, regardless 
of what fo(d) actually is, f m (d) is very well approximated by a Gaussian distribution. In a 
conventional random walk, d m is simply md, and Ad m = ^/mAd. Our example is slightly 
different, in that there exists a finite collection of di, and when m = N, each of them must 
have been picked. A good analogy is a random walk which must, after N steps, arrive at a 
known endpoint. Thus, while d m still increases linearly with m, the width of f m (d) increases 
to a maximum at m — N/2, then falls to again at m — N. The precise result is: 



d m = md (17) 




' m — 1 

Ad-r, = 4 / m 1 



)Ad (18) 



N — 1 J 

Using this ansatz to approximate f m (d) is quite effective when N is large; unfortunately, 
integrating Equation is then analytically intractable, so its usefulness is restricted to 
numerical computations. 

A simple example of this sort of analysis which is tractable occurs when there are only 
two kinds of environments: "good" environments with d = do, and "useless" environments 
with d = 0. Such an environment can be characterized by three quantities: the total number 
N of environments; the number n u of useful environments, and the distinguishability do of 
each useful environment. When m environments are randomly selected, the probability of 
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FIG. 4: Partial information plots (Z(m)) for bimodal distributions of environment (i-factors. The 
red curve shows I{m) for an environment where half of the N = 32 environments have d = 1.5 
and the other half have d = 0. The blue curve shows I{m) when 6 environments have d = 3 and 
the rest are useless (d = 0). The corresponding black curves demonstrate what happens when the 
equivalent amounts of ds are spread evenly over all 32 environments - respectively, di = § and 
di = In both cases, there is a smoothing effect, but the smoothing is most pronounced when 
ds is concentrated in just a few environments. 

choosing m u G [max(0, m + n u — N) . . . min(m, n u )] useful environments can be computed 
with elementary combinatorics: 



Fig. 0] compares exact X(m) plots with the naive approximation T{m) = I(n u do/N), 
which achieves the same ds with indistinguishable environments. In all cases, the bimodal 
distribution produces a smoother PIP, which rises less quickly and has less of a "plateau" 
in the middle. This is a simple consequence of the averaging procedure, which effectively 
convolves 1{d) with a Gaussian, and thus produces a version of the original curve which is 
greatly smoothed around m = N/2, and relatively less smoothed near the ends. 




(19) 



(20) 



We integrate Equation HH1 to obtain 




(21) 
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Case 2: An ensemble of states 



In Section IIII1 we obtained an average PIP for randomly chosen states. For comparison, 
we'll now obtain an average PIP for all the product-environment states in Equation |5J 
We select the and randomly from the Haar measure for a single qubit, which is 
uniform over the Bloch sphere. A simple calculation shows that ji = |(^i|^)| 2 is uniformly 
distributed between and 1, which yields a perfect Poisson distribution for di and d m : 

fo(d) = e~ d (22) 

jm-l p -d 

^ = T^W < 23 > 



Thus armed, we proceed as in Section llTfl splitting Is-.s {m} into its three component entropies 
H(S), H(£{ m }), and H(S£{ m y), and averaging them independently. Computing the average 
entropy of a qubit that has been decohered by m environments turns out to involve a 
horrendous integral which yields infinite sums of Riemann zeta functions CO)- In the interest 
of brevity, we provide only the result here. 

H m = J f m (x)Z(x)dx 

(l + e~f ) In (l + e~f ) + (l - e~§) In (l - e~§) 



| /"OO 

' dxx m -\ 



m - 1)1 



ln(2) 



2 



-3^- J ( ln ( 2 ) - !) + y - 2 E ( ^rrq ) CO') (24) 

Fig. El shows averaged product-environment PIPs for various N. These plots contrast 
sharply with those in Fig. instead of hovering near 1=0, then shooting up to I = Xs-e 
at m = N/2, they rise rapidly to I ~ H(S), then plateau until nearly m = N. For the 
typical product-environment state, a small fraction of the environment yields a great deal 
of information about the system. The typical random state, on the other hand, encodes the 
information so that even a large minority of the sub-environments yields nothing. Fig. EJalso 
shows that, while typical product-environment states are slightly less generous with their 
information than the simple unimodal states discussed earlier, the difference is relatively 
small. We conclude, then, that product-environment states as a class display a much higher 
level of redundancy in information storage than typical unstructured states. 
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FIG. 5: Partial information plots (T(m)) averaged over the ensemble of all product-environment 
states. The four plots represent different numbers N of environments: the red, blue, green, and 
purple plots correspond respectively to N = 4, 8, 16, 32. The associated curves in black show the 
behavior of unimodal distributions with the same average d m = m. While the ensemble averages 
are (as expected) less inflected than the unimodal plots, redundant information storage is still 
quite evident. The ensemble averages are less smoothed than those obtained from the bimodal 
distribution (Fig. QJ, since ensemble average yields a relatively sharply-peaked Poisson distribution 
for f m (d). NOTE: all plots have been rescaled on the vertical axis byZ(N) for ease of comparison. 

V. REDUNDANCY 



The notion of redundancy has been mentioned, but so far not addressed in sufficient 
detail. Redundancy was introduced in this context in fl Q, and developed into a more 
precise tool in [4], where the information about S was considered i?-fold redundant if and 
only if the environment could be partitioned into R sub-environments, each of which had 
nearly full information ( "nearly full" meaning that it could supply a fraction 1 — 5 of the 
total information). Reference 4( defines such "^-redundancy" using Shannon mutual infor- 
mation between observables of the system S and subsystems of the environment. We follow 
the idea of asking about "nearly all" information, but depart from [4| by using quantum 
mutual information as our metric (thus choosing the other option from the two possibilities 
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considered in 




Using the quantum mutual information forces us to change our definition of "nearly full 
information." The metric of information used in measure classical mutual information - 
information about a single observable. Since it's possible for each of many environments to 
have the same amount of information about an observable, "nearly full" in the context of ^| 
means T su b environment > (l—5)l to tai- The quantum mutual information, by contrast, measures 
information about all possible observables - qubits A and B can have two bits of quantum 
mutual information if they are entangled, instead of just one. Since we find in general that, 
by this metric, the most information about S that can be shared among many environments 
is \T t otai (see Fig. Q), we redefine "nearly full information" as T suhenvironment > §(1 - S)X totol . 

We note, however, that if 2 environments each have equal information about different 
parts of the system (this is only possible with systems larger than a single qubit), then each 
has \lg£ mutual information with the system, yet there is no redundancy at all. In fact, 
in such a case it's not possible to tell for certain whether the stored information is 1-fold 
redundant (i.e., not redundant at all), or 2-fold redundant. In general, if R sub-environments 
have \lse information, then the actual redundancy of the information could be anywhere 
between R — 1 and R. For this reason, we take the lower bound, and define redundancy as 
follows: 

Definition 1. When the full environment S has T$e mutual information with the system, 
and can be partitioned into at least R$ + 1 sub- environments Si such that each Si has at 
least ^-Tse mutual information with the system, then the information is stored with at least 
Rs-fold redundancy, and we say that R$ is the redundancy (with fudge- factor 5). 

VI. IMPLICATIONS 

The main implication of the preceding analysis is clear and simple: arbitrary states of 
a many-part environment encode information in entangled modes of the parts, whereas 
product-environment states of the type likely to be produced by decoherence store the in- 
formation redundantly. However, a number of more subtle implications deserve attention. 
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A. Redundancy and Z(m) 



The relationship between redundancy and the X(m) plot requires some elucidation. If we 
consider Definition^ it's apparent that R$ can be calculated for product-environment states 
with very little trouble. For a particular d$, let d r be the minimum value of dg {m} such that 
I(ds, d£ {m y) = =^Isb- Then any sub-environment £{ m ] with dg {m} > d r has at least ^-^Tgg 
mutual information with the system. Assuming infinite divisibility of the environment, the 
number of such sub-environments that can be created by a partition of £ is precisely dg/d r , 
so the redundancy can be calculated exactly as 

R = d j- - 1 (25) 

The assumption of infinite divisibility is not really justified, but (a) optimizing the partition 
exactly is a very tedious combinatoric problem with no closed-form solution, and (b) the 
R obtained this way is not only a strict upper bound on the exact R, but also more useful 
for some purposes. For instance, when dg = 10, d r = 1, and each of 11 environments has 
d{ — jj, the strict definition of R would force us to partition E into 5 sub-environments, each 
of which has almost twice the required d. The assumption of infinite divisibility essentially 
allows us to partition 8 into 10 sub-environments containing 1.1 environments each, which 
(while physically absurd) better reflects the fact that the information is much more than 
4-fold redundant. 

The simplicity of this analysis begs the question: why examine Z(m) in the first place? 
Not only does X(m) seem to be irrelevant to redundancy, but in some cases (see the small- iV 
plots in Fig. EJ) it can be actively misleading! The answer is that Z(m) can be calculated and 
analyzed for a wide range of states, whereas the redundancy calculation just given is only 
simple for the sub-ensemble of product-environment states. Additionally, X(m) provides 
a good intuition about the quantitative nature of a given state or ensemble of states; is 
information generally encoded in entangled states, stored redundantly, or independently 
distributed among environments? 

B. Perfectly redundant and perfectly encoding states 

As noted previously, the primary intuition that we obtain from the Z(m) plots is that 
most states are "coding" states, but an important sub-ensemble of states are "redundant" 
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states. We are naturally led to ask whether "perfect" examples of each type of state exist - 
that is, a state that encodes information more redundantly than any other state, or a state 
that hides the encoded information better than any other state. 

The answer is somewhat surprising: whereas perfectly redundant states exist for any N, 
perfect coding states apparently exist only for certain N. The perfectly redundant states 
are easy to understand; they are the generalized GHZ (and GHZ-like) states of the form: 

\*se) = a \0) s (g) |0) £j + p \l) s (g) (26) 

i i 

A true GHZ state is invariant under interchange of any two subsystems; however, since 
mutual information is invariant under local unitaries, we only require that the states 1 0)^-. 
and be orthogonal. Clearly, such states exist for all N. Any sub-environment with 
< m < N has exactly H(S) information, but only by capturing the entire environment 
(m = N) can we obtain the full I = 2H(S). Thus, the information is stored with iV-fold 
redundancy. Note that this sort of state is slightly irregular when analyzed as in Section HVl 
since di — oo. 

A perfect coding state, on the other hand, would be one where X(m) = for any m < N/2, 
and T(m) = Xse for m > N/2. An equivalent condition is the existence of two orthogonal 
states of N qubits, each of which is maximally entangled under all possible bipartite divisions. 
If such pairs of states exist, then the system states 10) and 11) can be correlated with them 

n 

to produce the perfect coding state. It is known (as detailed in |ll[) that such states only 
exist for N = 2,3,5,6, and possibly for N = 7 (for N = Q, only a single state exists jl?|). 
Thus, while for large N almost every state is an excellent coding state, perfect examples 
seem not to exist except for N = 2, 3, 5, (7?)! 

C. Why I(m) can be misleading 

As noted in Sections IVI Al and lIV[ plots of X(m) can sometimes be confusing or misleading. 
An excellent example of this is the case of a product-environment state with a simple bimodal 
distribution of d: out of iV environments, n u have d = oo and N — n u have d = 0. This 
is a GHZ state with n u environments, onto which N — n u totally useless ancillas have been 
tacked. As such, we note immediately that the information is stored with exactly n u -fold 
redundancy, since any partition of the iV environments into n u sub-environments such that 
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FIG. 6: The effect of diluting the environment. T(m) is plotted for environments containing 
N = 8,9, 12, 16,64 qubits; in each case, n u = 8 of the qubit environments are "useful" (d = oo), 
while the other N — 8 are useless (d = 0). All five environments are the same from a redundancy 
perspective, since the extra d = qubits are irrelevant. The I{m) curves, however, are dispro- 
portionately distorted by the sub-environments which contain no useful information. This dilution 
effect demonstrates the hazards of inferring redundancy properties from I(m) . 

each sub-environment contains one useful environment will satisfy the condition. Naively, 
we might expect that the corresponding X{m) plot would reflect this fact, and appear like a 
scaled version of a GHZ-state plot. 

However, as shown in Fig. El diluting the environment in this way has a more dramatic 
effect. Compared to the scaled GHZ-state plot, the X{m) curves (shown for varying N, with 
n u held fixed) have been smoothed. For sufficiently large N, the plots appear to indicate very 
little redundancy - yet by construction, each environment has exactly the same redundancy. 

The explanation is inherent in the well-known fact that f(x) ^ f(x). If N = 8 and n u = 4, 
then on average only m = 2 environments must be captured in order to obtain one useful 
environment (so that X = H(S)). However, the probabilities of capturing (respectively) 
and 2 useful environments do not cancel each other, since an additional useful environment 
adds nothing to X, but failure to capture any useful environments reduces X to 0. Thus, the 
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unlikely event of getting no information at all is disproportionately reflected in the average 
over possible sub-environments. The diluted GHZ state is the most dramatic example of this 
phenomenon, but it occurs in every X(m) plot except the unimodal distribution discussed 
before. Thus, while X(m) is very useful for qualitative analysis of redundancy, it's not 
quantitatively reliable. 



VII. CONCLUSIONS 



This paper introduces a program of analyzing how information about a system is stored 
in its environment, along with some of the first insights into typical results. While we 
motivate our discussion by discussing decoherence, our results are not specific to any model 
of decoherence, or even to decoherence processes in general. However, the most important 
conclusion that we've presented is that most states store information nonredundantly, while 
a certain subclass of states store information redundantly. The evidence of the world around 
us indicates unambiguously that information about macroscopic objects is highly redundant 
in their environments, so we conclude that (a) natural processes do not produce randomly 
chosen states, and (b) natural processes may favor states of the product-environment type. 

However, this is still only a conjecture 2, m Such states are favored by the ultra-simplified 
models of quantum measurement used for toy models, but in reality both measurement and 
decoherence are far more complex. We do not believe that all physical processes lead exclu- 
sively to states of the form given in Equation [SJ but perhaps when decoherence dominates, 
one obtains some similar ensemble with the same properties. The obvious next step is to 
analyze actual models of decoherence and quantum measurement, identify the ensembles 
of states produced by them, and evaluate the redundancy of information storage in those 
states. We expect that this program of research will sharpen our understanding both of 
decoherence processes and of the classicality which they induce in the universe. 
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